618 results found.
Written
Corpus,
Language Type:
Bilingual
Languages:
English German
Availability:
Freely Available
License:
Size:
450M sentences Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Learning Deep Transformer Models for Machine Translation
-
Paper track:Long/Machine Translation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Qiang Wang | WMT 2016 Translation Task Data | /N |
Documentation:
None
Written
Corpus,
Language Type:
Bilingual
Languages:
English German
Availability:
From Data Center(s)
License:
Size:
None Production Status:
Existing-used
Use:
-
Paper title:Neural Architectures for Nested NER through Linearization
-
Paper track:Short/Tagging, Chunking, Syntax and Parsing
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Jana Straková | CoNLL 2003 Shared Task Named Entity data | /N |
Documentation:
None
Written
Corpus,
Language Type:
Bilingual
Languages:
English German
Availability:
Freely Available
License:
Size:
2 GByte Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Retrieving Sequential Information for Non-Autoregressive Neural Machine Translation
-
Paper track:Long/Machine Translation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Chenze Shao | wmt14 | /N |
Documentation:
None
Written
Corpus,
Language Type:
Monolingual
Languages:
German
Availability:
Freely Available
License:
Size:
20,217 sentences Production Status:
Newly created-finished
Use:
Opinion Mining/Sentiment Analysis
-
Paper title:A Corpus for Argumentative Writing Support in German
-
Paper track:Long paper/
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Thiemo Wambsganss | Argumentation Annotated Student Peer Reviews Corpus | /N |
Documentation:
https://github.com/thiemowa/argumentative_student_peer_reviews/blob/main/guidlineV2.pdf, publicly available, in German
Written
Corpus,
Language Type:
Bilingual
Languages:
English German
Availability:
License:
Size:
None Production Status:
Existing-used
Use:
Named Entity Recognition
-
Paper title:Exploring Cross-sentence Contexts for Named Entity Recognition with BERT
-
Paper track:Long paper/
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Jouni Luoma | CoNLL 2003 Shared Task Named Entity data | /N |
Documentation:
None
Written
Tagger/Parser,
Language Type:
Multilingual
Languages:
Catalan Chinese Czech English German Spanish
Availability:
Freely Available
License:
CC BY-SA-NC 4.0
Size:
None Production Status:
Newly created-finished
Use:
Semantic Role Labeling
-
Paper title:Bridging the Gap in Multilingual Semantic Role Labeling: a Language-Agnostic Approach
-
Paper track:Long paper/
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Simone Conia | Multi-SRL (COLING 2020) | /N |
Documentation:
None
Written
Evaluation Data,
Language Type:
Multilingual
Languages:
Albanian Croatian English German Russian Turkish
Availability:
Freely Available
License:
Size:
10 MByte Production Status:
Existing-updated
Use:
Document Classification, Text categorisation
-
Paper title:XHate-999: Analyzing and Detecting Abusive Language Across Domains and Languages
-
Paper track:Long paper/
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Mladen Karan | XHate-999 | /N |
Documentation:
There is an accompanying paper detailing dataset createion as well as a short readme with technical details that accompanies the dataset.
Written
Corpus,
Language Type:
Bilingual
Languages:
English German
Availability:
Freely Available
License:
Size:
5,900,000 sentences Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Is MAP Decoding All You Need? The Inadequacy of the Mode in Neural Machine Translation
-
Paper track:Long paper/
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Bryan Eikema | WMT 2018 News Translation Task Data | /N |
Documentation:
None
Written
Corpus,
Language Type:
Bilingual
Languages:
English German
Availability:
Freely Available
License:
Creative Commons
Size:
4.5M sentences Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Dynamic Curriculum Learning for Low-Resource Neural Machine Translation
-
Paper track:Long paper/
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Chen Xu | WMT 2016 English-German | /N |
Documentation:
N/A
Written
Corpus,
Language Type:
Multilingual
Languages:
English French German Greek Italian Latin Spanish
Availability:
Freely Available
License:
Size:
1.55 GByte Production Status:
Existing-used
Use:
Language Identification
-
Paper title:Detecting de minimis Code-Switching in Historical German Books
-
Paper track:Short paper/
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Shijia Liu | Deutsches Textarchiv | /N |
Documentation:
None




